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SEQUENCE LISTING 



B§sUF£WT0 06 JAN 2006 



<110> Anderson, 3 C 
wu, Ning 
Santoro, Stephen 
schultz, Peter G 

<- l^T^X^- * N0 — tRNA 

<130> 54-000330PC 

<140> PCT/US 04/022187 
<141> 2004-07-07 

<160> 37 

<170> Patentin version 3.2 

<210> 1 
<211> 78 

<212> RNA . 
<213> Pyrococcus abyssi 

Cccgguag cuuagccugg uuagagcggc ggacucuuaa uccgcagguc ggggguucaa 60 
aucccccccg gcccgcca 

<210> 2 

<2U> 78 

<212> RNA • , 

<213> Pyrococcus funosus 



78 



gggj'cgguag cuuagccugg uuagagcggc ggacucuuaa uccgcagguc ggggguucaa 
aucccccccg gcccgcca 



60 
78 



<210> 3 
<211> 78 

<213> Pyrococcus horikoshii 



fg^cgguag cuuagccugg uuagagcggc ggacucuuaa uccgcagguc ggggguucaa 
aucccccccg gcccgcca 



60 
78 



<210> 4 

<211> 78 

<212> RNA 

<213> pyrococcus abyssi 



Scgguag cucagccugg ucagagcacc gggcuuuuaa cccggugguc gcggguucaa 
aucccgcccg gcccgcca 



60 
78 



<210> 5 
<211> 78 

<213> Pyococcus horikoshii ^ 
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gggfcgguag cucagccugg ucagagcacc gggcuuuuaa cccggugguc gcggguucaa 
aucccgcccg gcccgcca 



<210> 6 

<211> 78 

<212> RNA 

<213> Pyrococcus funosus 



gggccgguag cucagccugg uuagagcacc gggcuuuuaa cccggugguc gcggguucaa 
aucccgcccg gcccgcca 

<210> 7 
<211> 78 
<212> RNA 

<213> pyrobaculum aeroprn lum 

gggJccguag cucagcccgg uuagagcggc gggcuuuuaa cccguagguc guggguucga 
aucccaccgg gcccgcca 

<210> 8 
<211> 77 

<212> RNA . . . 

<213> Thermoplasma acidoprn lum 

gggSccguag cuuagcuagg uagagcgaug gacucuuaau ccauagguca gggguccaaa 
uccccucgga cccgcca 

<210> 9 
<211> 77 
<212> RNA 

<213> Thermoplasma vol earn urn 

ggguccguag cuuagcuagg uagagcgaug gacucuuaau ccauagguca gggguccaaa 
uccccucgga cccgcca 

<210> 10 
<211> 77 

<212> RNA ^ _ . . 

<213> Archaeoglobus fulgidus 

ggg2cgglag cuuagccagg cagagegegg gacucuuaau cccgcagucg gggguucaaa 
ucccucccgg cccgcca 

<210> 11 
<211> 77 
<212> RNA 

<213> Halobactenum sp. NRC-l 

page 2 



54-000330US.ST25.txt 
gggJcgglag cucagucugg cagagcgacg gacucuuaau ccgucggucg cguguucaaa 
ucgcgcccgg cccgcca 

<210> 12 
<211> 77 
<212> RNA 

<213> Thermoplasma acidoprn I urn 

gggc'ccguag cucagccagg uagagcaucu ggcuuuuaac caggugguca gggguucgaa 
cccccucggg cccgcca 

<210> 13 
<211> 77 
<212> RNA 

<213> Thermoplasma vol cam urn 

ggg?ccgilg cucagccagg uagagcaucu ggcuuuuaac caggugguca gggguucgaa 
cccccucggg cccgcca 

<210> 14 
<211> 77 

<212> RNA ... 
<213> Methanococcus jannascrm 

gggc'ccguag cucagucugg cagagcgccu ggcuuuuaac cagguggucg aggguucaaa 
ucccuucggg cccgcca 

<210> 15 
<211> 77 

<213> Sethanobacterium thermoautotrophicum 

ggg2ccguag cucagucugg cagagcgcuu ggcuuuuaac caaguggucg cggguucaau 
ucccgucggg cccgcca 

<210> 16 
<211> 77 
<212> RNA 

<213> Methanosarcina mazen 

gggc'ccguag cuuagucugg uagagcgccu gacuuuuaau caggcggucg aggguucgaa 
ucccuucggg cccgcca 

<210> 17 

<211> 77 

<212> RNA . . 

<213> Sulfolobus tokodan 



<400> 17 
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gggcccguag cucagccagg uagagcggcg ggcucuuaac ccguaggucc cggguucaaa 

ucccggcggg cccgcca 

<210> 18 
<211> 77 

<212> RNA . . 

<213> Sulfolobus tokodan 

ggg2ccgl!g cucagccagg uagagcggcg ggcuuuuaac ccguaggucc cggguucaaa 
ucccggcggg cccgcca 

<210> 19 
<211> 77 
<212> RNA 

<213> pyrobaculum aeroprn lum 

gggJccglag cucagccugg uagagcggcg ggcucuuaac ccguaggucg uggguucgaa 
ucccaccggg cccgcca 

<210> 20 
<2U> 77 
<212> RNA 

<213> Archaeoglobus fulgidus 

ggg?ucguag cucagccagg cagagcgacg ggcuuuuaac ccgucggucg cggguucaaa 
ucccgucgag cccgcca 

<210> 21 
<211> 76 
<212> RNA 

<213> Sulfolobus sol f atari cus 

gggJccgSag cuuagccagg uagagcgacg ggcucuuaac ccguaguccc ggguucgaau 
cccggcgggc ccgcca 

<210> 22 
<211> 79 
<212> RNA 

<213> Aeropyrum perrnx 

ggg?ccguig cucagccugg uagagcggcg ggcucuuacc ccgcggaagu cccggguuca 
aaucccggcg ggcccgcca 

<210> 23 

<211> 80 

<212> RNA 

<213> Artificial 



<220> 

<223> consensus tRNA 
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<220> 

<221> misc_feature 

<222> (36).. (36) 

<223> n indicates no consensus (u or c) 
<220> 

<221> misc_feature 

<222> (43).. (43) 

<223> n indicates no consensus (gap or c) 
<220> 

<221> misc_feature 

<222> (44).. (44) 

<223> n indicates no consensus (gap or g) 

gggcccguag cucagccugg uuagagcggc gggcunuuaa ccnncggagg ucgcggguuc 60 
aaaucccgcc gggcccgcca 

<210> 24 

<211> 77 

<212> RNA 

<213> Artificial 

<220> 

<223> consensus-derived tRNA akcua 

gggcccguag cucagccugg uagagcggcg ggcucuaaac ccgcaggucg cggguucaaa 60 
ucccgccggg cccgcca 

<210> 25 

<211> 78 

<212> RNA 

<213> Artificial 

< 220> 

<223> consensus-derived tRNA library 
<220> 

<221> mi sc_f eature 

<222> CD - • (4) 

<223> n is a, c, g or u 

<220> 

<221> mi sc_feature 

<222> (71).. (74) 

<223> n is a, c, g or u 

nnnnccguag cucagccugg uagagcggcg ggcuuccuaa cccgcagguc gcggguucaa 60 

78 

aucccgccgg nnnngcca 

<210> 26 
<211> 78 
<212> RNA 
<213> Artificial 
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<220> 

<223> mutant tRNA 



uggSccguag cucagccugg uagagcggcg ggcuuccuca cccgcagguc gcggguucaa 60 

78 

aucccgccgg acuagcca 



<210> 27 

<211> 1092 

<212> DNA 

<213> Artificial 

<220> 

<223> mutant tRNA synthetase 

atggttcatt gggccgatta tattgctgat aaaataatta gagagagggg ggagaaggag 60 
aagtacgttg ttgagagtgg aataacgcca agtggttacg ttcacgttgg gaactttagg 120 
gagcttttta cagcttatat tgtgggccat gccctaaggg ataaggggta tgaggttagg 
cacatccaca tgtgggatga ttatgataga tttaggaagg ttccaaggaa cgttccccag 
gaatggaaag attacctggg aatgcccatt agtgaagttc ctgatccctg gggatgccat 
gagagttatg ctgaacactt catgagaaag ttcgaggagg aggtagaaaa attagggatc 
gaagttgact ttctttatgc gagtgaactc tacaagagag gggaatattc tgaggagata 
aggttagcct ttgagaaaag ggataagata atggagatac taaacaagta tagggaaatt 
gcgaaacaac ctccccttcc agagaactgg tggcccgcaa tggtttactg ccctgagcat 
aggagggaag cagagatcat tgaatgggat gggggctgga aggttaagta taagtgcccc 
gaaggtcacg agggatgggt tgatataagg agtgggaacg tgaaactgag gtggcgtgtt 
gattggccca tgcgttggtc tcactttggc gttgacttcg aacctgctgg aaaggatcat 
cttgtggctg gttcaagcta cgatacggga aaggagatta taaaggaagt ttatggaaag 
gaagctccgt tatctttaat gtatgagttt gttggaatta aggggcagaa ggggaagatg 
agtggtagta agggaaatgt tattttactc agcgatctgt atgaggttct tgagccaggt 
ctcgttagat ttatctacgc tcggcatagg ccaaacaagg agataaagat agatctaggt 
cttggcattc taaacctcta cgatgagttc gataaagttg agagaatata cttcggggtt 
gagggtggta aaggtgatga tgaagaatta aggaggactt acgagctttc ggtgatgctg 
ccaacttact ga 

<210> 28 

<211> 363 

<212> PRT 

<213> Artificial 

<220> 

<223> mutant tRNA synthetase 
<400> 28 
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240 
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1020 
1080 
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Met val His xrp Ala Asp Tyr He Ala Asp Lys He He Arg Glu Arg 



Gly Glu Lys Glu Lys Tyr val val Glu ser Gly He Thr Pro ser Gly 
20 " 

Tyr val His val Gly Asn Phe Arg Glu Leu Phe Thr Ala Tyr He val 
35 ^ 

G ly His Ala Leu Arg Asp Lys Gly Tyr Glu val Arg His He His Met 
50 55 

Trp Asp Asp Tyr Asp Arg Phe Arg Lys val Pro Arg Asn Val Pro Gin 
65 

Glu Trp Lys asp Tyr Leu Gly Met Pro He ser Glu val Pro Asp Pro 



85 



Trp Gly cys His Glu ser Tyr Ala Glu His Phe Met Arg Lys Phe Glu 



100 



Glu Glu val Glu Lys Leu Gly He Glu val Asp Phe Leu Tyr Ala Ser 



115 



Glu Leu Tyr Lys Arg Gly Glu Tyr ser Glu Glu He Arg Leu Ala Phe 



130 



135 



Glu Lys Arg Asp Lys lie Met Glu He Leu Asn Lys Tyr Arg Glu He 
145 150 

Ala Lys Gin Pro Pro Leu Pro Glu Asn Trp Trp Pro Ala Met val Tyr 



165 



cys Pro Glu His Arg Arg Glu Ala Glu He He Glu Trp Asp Gly Gly 



180 



Trp Lys vjl Lys Tyr Lys cys Pro Glu Gly His Glu Gl ? Trp val Asp 



195 



lie Arg Ser Gly Asn val Lys Leu Arg Trp Arg val Asp Trp Pro Met 



210 



Arg Trp Ser His Phe Gly Val Asp Phe Glu Pro Ala Gly Lys Asp His 
225 230 

ueu val Ala Gly Ser ser Tyr Asp Thr Gly Lys Glu He He Lys Glu 
245 

val Tyr Gly Lys Glu Ala Pro Leu ser Leu Met Tyr Glu Phe val Gly 
260 zto 
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He Lys Gly Gin Lys Gly Lys Met ser Gly Sen Lys Gly Asn val lie 

L eu Leu ser Asp Leu Tyr Glu val Leu Glu Pro Gly Leu val Arg Phe 
290 295 

He Tyr Ala Arg His Arg Pro Asn Lys Glu lie Lys He Asp Leu Gly 
305 310 

Leu Gly He Leu Asn Leu Tyr Asp Glu Phe Asp Lys val Glu Arg He 

Tyr Phe Gly val Glu Gly Gly Lys Gly Asp Asp Glu Glu Leu Arg Arg 
y 340 345 

Thr Tyr Glu Leu Ser Val Met Leu Pro Thr Tyr 
355 360 

<210> 29 

<211> 1572 

<212> DNA . 

<213> Artificial 

<220> 

<223> mutant tRNA synthetase 

£S&t«?t gggccgatta tattgctgat aaaataatta gagagagggg ggagaaggag 
aagtacgttg ttgagagtgg aataacgcca agtggttacg ttcacgttgg gaactttagg 
gagcttttta cagcttatat tgtgggccat gccctaaggg ataaggggta tgaggttagg 
cacatccaca tgtgggatga ttatgataga tttaggaagg ttccaaggaa cgttccccag 
gaatggaaag attacctggg aatgcccatt agtgaagttc ctgatccctg gggatgccat 
gagagttatg ctgaacactt catgagaaag ttcgaggagg aggtagaaaa attagggatc 
gaagttgact ttctttatgc gagtgaactc tacaagagag gggaatattc tgaggagata 
aggttagcct ttgagaaaag ggataagata atggagatac taaacaagta tagggaaatt 
gcgaaacaac ctccccttcc agagaactgg tggcccgcaa tggtttactg ccctgagcat 
aggagggaag cagagatcat tgaatgggat gggggctgga aggttaagta taagtgcccc 
gaaggtcacg agggatgggt tgatataagg agtgggaacg tgaaactgag gtggcgtgtt 
gattggccca tgcgttggtc tcactttggc gttgacttcg aacctgctgg aaaggatcat 
cttgtggctg gttcaagcta cgatacggga aaggagatta taaaggaagt ttatggaaag 
gaagctccgt tatctttaat gtatgagttt gttggaatta aggggcagaa ggggaagatg 
agtggtagta agggaaatgt tattttactc agcgatctgt atgaggttct tgagccaggt 
ctcgttagat ttatctacgc tcggcatagg ccaaacaagg agataaagat agatctaggt 
cttggcattc taaacctcta cgatgagttc gataaagttg^gagaatata cttcggggtt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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gagggtggta aaggtgatga tgaagaatta aggaggactt acgagctttc aatgcctaag 1080 

aagcctgaga gattagtcgc tcaagctcct tttaggttcc tagcggtgtt ggttcagtta 1140 

ccgcatttaa ccgaagaaga cataataaat gttctaatca aacagggaca tattcccagg 1200 

gatctatcca aggaggacgt tgagagggtt aaacttagga taaaccttgc taggaattgg 1260 

gttaaaaagt atgcccctga ggatgttaaa ttctcaatac ttgagaaacc tccagaagtt 1320 

gaggtaagtg gagatgttag ggaggccatg aatgaggttg ctgagtggct tgagaatcat 1380 

gaggaattta gcgttgaaga gtttaataac attctattcg aagttgccaa gaggaggggg 1440 

atatccagta gggagtggtt ttcgacgctc tacagattat ttattggaaa ggaaagggga 1500 

ccgagattgg ccagtttcct ggcatctctt gataggagtt tcgttattaa acgacttaga 1560 

cttgagggat ag 1572 

<210> 30 

<211> 523 

<212> PRT 

<213> Artificial 

<220> 

<223> mutant tRNA synthetase 
<400> 30 

Met val His Trp Ala Asp Tyr lie Ala Asp Lys He lie Arg Glu Arg 
1 5 10 15 

Gly Glu Lys Glu Lys Tyr Val val Glu Ser Gly lie Thr Pro Ser Gly 
20 25 30 

Tyr Val His val Gly Asn Phe Arg Glu Leu Phe Thr Ala Tyr lie val 
35 40 45 

Gly His Ala Leu Arg Asp Lys Gly Tyr Glu val Arg His lie His Met 
50 55 60 

Trp Asp Asp Tyr Asp Arg Phe Arg Lys val Pro Arg Asn Val Pro Gin 
65 70 75 80 

Glu Trp Lys Asp Tyr Leu Gly Met Pro lie Ser Glu Val Pro Asp Pro 
85 .90 95 

Trp Gly Cys His Glu Ser Tyr Ala Glu His Phe Met Arg Lys Phe Glu 
100 105 110 

Glu Glu val Glu Lys Leu Gly lie Glu val Asp Phe Leu Tyr Ala Ser 
115 120 125 

Glu Leu Tyr Lys Arg Gly Glu Tyr Ser Glu Glu lie Arg Leu Ala Phe 
130 135 140 
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Met Gli 

145 "'" ' ' 150 



Glu Lys Arg Asp Lys lie Met Glu He Leu Asn Lys Tyr Arg Glu lie 



Ala Lys Gin Pro Pro Leu pro Glu Asn Trp Trp Pro Ala Met val Tyr 
165 x/u 

cys Pro Glu His Arg Arg Glu Ala Glu He He Glu Trp Asp Gly Gly 
' 180 185 iyu 

Trp Lys val Lys Tyr Lys Cys Pro Glu Gly His Glu Gly Trp val Asp 
195 200 

lie Arg Ser Gly Asn val Lys Leu Arg Trp Arg val Asp Trp pro Met 
210 215 lZ0 

Arg Trp ser His Phe Gly val Asp Phe Glu Pro Ala Gly Lys Asp His 
225 230 235 

Leu val Ala Gly ser Ser Tyr Asp Thr Gly Lys Glu He He Lys Glu 
245 250 z:>:) 

val Tyr Gly Lys Glu Ala Pro Leu Ser Leu Met Tyr Glu Phe val Gly 
260 265 i-iv 

He Lys Gly Gin Lys Gly Lys Met ser Gly Ser Lys Gly Asn val lie 
275 280 ^ e -> 

Leu Leu ser Asp Leu Tyr Glu Val Leu Glu Pro Gly Leu Val Arg Phe 
290 295 300 

He Tyr Ala Arg His Arg Pro Asn Lys Glu lie Lys He Asp Leu Gly 
305 310 315 

Leu Gly lie Leu Asn Leu Tyr Asp Glu Phe Asp Lys val Glu Arg He 
325 330 

Tyr Phe Gly val Glu Gly Gly Lys Gly Asp Asp Glu Glu Leu Arg Arg 
340 345 " u 

Thr Tyr Glu Leu ser Met Pro Lys Lys Pro Glu Arg Leu val Ala Gin 

Ala pro Phe Arg Phe Leu Ala val Leu val Gin Leu Pro His Leu Thr 
370 375 38U 

Glu Glu Asp He He Asn val Leu He Lys Gin Gly His He Pro Arg 
385 390 395 

Asp Leu ser Lys Glu Asp val Glu Arg val Lys Leu Arg He Asn Leu 
405 410 
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Ala Arg Asn Trp val Lys Lys Tyr Ala Pro Glu Asp val Lys Phe sen 
420 425 43U 

He L eu Glu Lys Pro Pro Glu val Glu val Ser Gly Asp val Arg Glu 
435 440 44:> 

Ala Met Asn Glu val Ala Glu Trp Leu Glu Asn His Glu Glu Phe Ser 
450 455 4t>0 

val Glu Glu Phe Asn Asn lie Leu Phe Glu val Ala Lys Arg Arg Gly 
465 470 4 '5 HOU 

He Ser Ser Arg Glu Trp Phe Ser Thr Leu Tyr Arg Leu Phe lie Gly 
485 490 H33 

Lys Glu Arg Gly Pro Arg Leu Ala ser Phe Leu Ala ser Leu Asp Arg 
500 505 5 

Ser Phe Val lie Lys Arg Leu Arg Leu Glu Gly 
515 520 

<210> 31 

<211> 4813 

<212> DNA 

<213> Artificial 

<220> 

<223> plasmid pACKO-Al84TAG 
<220> 

<221> misc_feature 

<222> (3749). .(3749) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (3769). .(3769) 

<223> n is a, c, g, or t 



gaactccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 
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300 
360 
420 
480 
540 



600 
660 



840 
900 
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acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact 
ggcttactat gttggcactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 

aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc 720 

actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc 780 
ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa 
agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc 

agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc 960 

tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc 1020 

gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac 1080 

tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 1140 

gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt 1200 

agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg 1260 

tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt 1320 

cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc 1380 

aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca 1440 

atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc 1500 

atgtttgaca gcttatcatc gataagcttt aatgcggtag tttatcacag ttaaattgct 1560 

aacgcagtca ggcaccgtgt atgaaatcta acaatgcgct catcgtcatc ctcggcaccg 1620 

tcaccctgga tgctgtaggc ataggcttgg ttatgccggt actgccgggc ctcttgcggg 1680 

atatcggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 1740 
tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 
atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt 

attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa 1920 

gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac 1980 
agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt 2040 
aaagttctgc tatgtggcgc ggtattatcc cgtgttgacg ccgggcaaga gcaactcggt 2100 
cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat 2160 
cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac 2220 
actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg 2280 
cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc 2340 
ataccaaacg acgagcgtga caccacgatg ccttaggcaa tggcaacaac gttgcgcaaa 2400 
ctattaactg gcgaactact tactctagct tcccggcaac aattaataga ctggatggag 2460 
gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct 
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1800 
1860 



2520 



gataaatctg 
ggtaagccct 
cgaaatagac 
caccactaac 
tttaatttaa 
aacggcatgc 
gcttcctaat 
ggcgattaag 
gtgccaagct 
taattggctc 
gcaattcctt 
caccaaattt 
ccattattgg 
taatctcttc 
cttctccatc 
gctcccttgc 
ctaaataatg 
tttcatcctc 
aagccaatct 
tattacacaa 
tcgttcgctc 
agaacgccat 
gtagctcctt 
ttatcatgca 
ctgccaccat 
caggatgctg 
ttttatcagg 
gagagcctga 
agtcaataaa 
accgacgacc 
attcaggcgt 
cgccctgcca 
catcacagac 
tataatattt 



gagccggtga 
cccgtatcgt 
agatcgctga 
ccgggaccaa 
aaggatctag 
accattcctt 
gcaggagtcg 
ttgggtaacg 
taaaaaaaat 
taaaatcttt 
atttttaaat 
ttctggcctt 
atttccttca 
tggagagtca 
caaacccgtt 
taacatgtgt 
aatatcatta 
tcttgctata 
atagacattc 
agttttttat 
aaagaagcgg 
gagcggcctc 
ccggtgggcg 
actcgtagga 
acccacgccg 
ctggctaccc 
ctctgggagg 
gcaaactggc 
ccggtaaacc 
gggtcgaatt 
agcaccaggc 
ctcatcgcag 
ggcatgatga 
gcccatggtg 
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gcgtgggtct cgcggtatca ttgcagcact 



agttatctac acgacgggga gtcaggcaac 
gataggtgcc tcactgatta agcattggca 
gtttactcat atatacttta gattgattta 
gtgaagatcc tttttgataa tctcatgacc 
gcggcggcgg tgctcaacgg cctcaaccta 
cataagggag agcgtctggc gaaaggggga 
ccagggtttt cccagtcacg acgttgtaaa 
ccttagcttt cgctaaggat ctgcagttat 
ataagttctt cagctacagc attttttaaa 
aaactctcta actcctcata gctattaact 
tttatggtta aaggatattc aaggaagtat 
acaactccag ctgggcagta tgctttcttt 
tcaacagcta taaaattccc ttttgaagaa 
aagacagggt tgtgaataca aacaaccttt 
atttttctct gctccatccc tccaactgca 
acctgcatta ttggatagat aacttcagca 
agttccatac tccttcttgc tctttttaag 
agtgtataat ccttatcaag ctggaattca 
gttgagaata tttttttgat ggggcgccac 
cgccagggnt gtttttcttt tcaccagtna 
atttcttatt ctgagttaca acagtccgca 
cggggcatga ctatcgtcgc cgcacttatg 
caggtgccgg cagcgcccaa cagtcccccg 
aaacaagcgc cctgcaccat tatgttccgg 
tgtggaacac ctacatctgt attaacgaag 
cagaataaat gatcatatcg tcaattatta 
ctcaggcatt tgagaagcac acggtcacac 
agcaatagac ataagcggct atttaacgac 
tgctttcgaa tttctgccat tcatccgctt 
gtttaagggc accaataact gccttaaaaa 
tactgttgta attcattaag cattctgccg 
acctgaatcg ccagcggcat cagcaccttg 



aaaacggggg cgaagaagtt gtccatattg 
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gggg cca g at 

tatggatgaa 
ccaccaccac 
aaacttcatt 
aaaatccctt 
ctactgggct 
tgtgctgcaa 
acgacggcca 
aatctctttc 
tccattggat 
gtcaaatctc 
ttagctatct 
atcttagccc 
ctcatctttc 
tttggtaaaa 
acatcaacgc 
acctttggat 
gtagttttta 
gcgttacaag 
ttatttttga 
gacgggcaac 
ccgctgtccg 
actgtcttct 
gccacggggc 
atctgcatcg 
cgctaaccgt 
cctccacggg 
tgcttccggt 
cctgccctga 
attatcactt 
aattacgccc 
acatggaagc 
tcgccttgcg 
gccacgttta 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 



54-000330US.ST25.txt 
aatcaaaact ggtgaaactc acccagggat tggctgagac gaaaaacata ttctcaataa 
accctttagg gaaataggcc aggttttcac cgtaacacgc cacatcttgc gaatatatgt 
gtagaaactg ccggaaatcg tcgtggtatt cactccagag cgatgaaaac gtttcagttt 
gctcatggaa aacggtgtaa caagggtgaa cactatccca tatcaccagc tcaccgtctt 
tcattgccat acg 

<210> 32 

<211> 4814 

<212> DNA 

<213> Artificial 

<220> 

<223> plasmid pACKO-Al84AGGA 



<220> 

<221> mi sc_feature 

<222> (3750). .(3750) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_feature 
<222> (3770). .(3770) 
<223> n is a, c, g, or t 

gaactccgga tgagcattca tcaggcgggc- aagaatgtga ataaaggccg gataaaactt 

gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 

ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 

tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 

aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 

ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 

ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 

ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 

gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 

acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact 

ggcttactat gttggcactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 

aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc 

actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc 

ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa 

agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc 

agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc 

tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc 

gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac 
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tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 1140 

gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt 1200 

agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg 1260 

tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt 1320 

cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc 1380 

aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca 1440 

atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc 1500 

atgtttgaca gcttatcatc gataagcttt aatgcggtag tttatcacag ttaaattgct 1560 

aacgcagtca ggcaccgtgt atgaaatcta acaatgcgct catcgtcatc ctcggcaccg 1620 

tcaccctgga tgctgtaggc ataggcttgg ttatgccggt actgccgggc ctcttgcggg 1680 

atatcggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt 1740 

tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa 1800 

atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt 1860 

attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa 1920 

gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac 1980 

agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt 2040 

aaagttctgc tatgtggcgc ggtattatcc cgtgttgacg ccgggcaaga gcaactcggt 2100 

cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat 2160 

cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac 2220 

actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg 2280 

cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc 2340 

ataccaaacg acgagcgtga caccacgatg cctaggagca atggcaacaa cgttgcgcaa 2400 

actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga 2460 

ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc 2520 

tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga 2580 

tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga 2640 

acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggc accaccacca 2700 

ccaccactaa cccgggacca agtttactca tatatacttt agattgattt aaaacttcat 2760 

ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 2820 

taacggcatg caccattcct tgcggcggcg gtgctcaacg gcctcaacct actactgggc 2880 

tgcttcctaa tgcaggagtc gcataaggga gagcgtctgg cgaaaggggg atgtgctgca 2940 

aggcgattaa gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc 3000 
agtgccaagc ttaaaaaaaa tccttagctt tcgctaagga tctgcagtta taatctcttt 

page 15 



3060 



ctaattggct 


ctaaaatctt 


54 

tataagttct 


-000330US.ST25.txt 
tcagctacag cattttttaa 


axcca l xgga 




tgcaattcct 


tatttttaaa 


taaactctct 


aactcctcat 


agctattaac 


Lgtcaaarci. 


31 RO 
J lOU 


ccaccaaatt 


tttctggcct 


ttttatggtt 


aaaggatatt 


caaggaagta 


L t LdyC LdlC 


3?d.n 
3 £*+U 


tccattattg 


gatttccttc 


aacaactcca 


gctgggcagt 


atgctttctt 


-4- — .-4-/— -»--*--\/~«/— 

LdlCItdgCC 


3 3nn 


ctaatctctt 


ctggagagtc 


atcaacagct 


ataaaattcc 


cttttgaaga 


actcatctti 


3 3fiO 
jjDU 


ccttctccat 


ccaaacccgt 


taagacaggg 


ttgtgaatac 


aaacaacctt 


tt l tgg uaaa 


3/190 
3*t£U 


agctcccttg 


ctaacatgtg 


tatttttctc 


tgctccatcc 


ctccaactgc 


aaca icadcg 


34R0 


cctaaataat 


gaatatcatt 


aacctgcatt 


attggataga 


taacttcagc 


aacc ii igga 


3 D*+U 


ttttcatcct 


ctcttgctat 


aagttccata 


ctccttcttg 


ctctttttaa 


ggiag l r l u l 


3 goo 


aaagccaatc 


tatagacatt 


cagtgtataa 


tccttatcaa 


gctggaattc 


agcg l uacaa 


jDDU 


gtattacaca aagtttttta 


tgttgagaat 


atttttttga 


tggggcgcca 


ciiaiiiiig 


j / £.\J 


atcgttcgct 


caaagaagcg 


gcgccagggn 


tgtttttctt 


ttcaccagtn 


agacgggcaa. 


37RO 


cagaacgcca tgagcggcct 


catttcttat 


tctgagttac 


aacagtccgc 


^ ^* ^ v~ /"t "f— /** 

accgcLgi.cc 


3R40 


ggtagctcct 


tccggtgggc 


gcggggcatg 


actatcgtcg 


ccgcacttat 


gac xg lc l lc 


3Q00 


tttatcatgc 


aactcgtagg 


acaggtgccg 


gcagcgccca 


acagtccccc 


ggccacgggg 


3Q^n 


cctgccacca 


tacccacgcc 


gaaacaagcg 


ccctgcacca 


ttatgttccg 


gatctgcatc 


A070 


gcaggatgct 


gctggctacc 


ctgtggaaca 


cctacatctg 


tattaacgaa 


gcgctaaccg 


AORO 
*f UoU 


tttttatcag gctctgggag gcagaataaa tgatcatatc gtcaattatt acctccacgg 


41 AO 


ggagagcctg 


agcaaactgg 


cctcaggcat 


ttgagaagca 


cacggtcaca 


ctgcttccgg 


a^oo 
*+^uu 


tagtcaataa accggtaaac 


cagcaataga 


cataagcggc 


tatttaacga 


ccctgccctg 




aaccgacgac 


cgggtcgaat 


ttgctttcga 


atttctgcca 


ttcatccgct 


tattatcact 


A3?o 

*f j^U 


tattcaggcg 


tagcaccagg 


cgtttaaggg 


caccaataac 


tgccttaaaa 


aaattacgcc 


A3RO 


ccgccctgcc 


actcatcgca 


gtactgttgt 


aattcattaa 


gcattctgcc 


gacatggaag 


A A AH 


ccatcacaga 


cggcatgatg 


aacctgaatc 


gccagcggca 


tcagcacctt 


gtcgccttgc 


a^oo 


gtataatatt 


tgcccatggt 


gaaaacgggg 


gcgaagaagt 


tgtccatatt 


ggccacgttt 




aaatcaaaac 


tggtgaaact 


cacccaggga 


ttggctgaga 


cgaaaaacat 


attctcaata 


4620 


aaccctttag 


ggaaataggc 


caggttttca 


ccgtaacacg 


ccacatcttg 


cgaatatatg 


4680 


tgtagaaact 


gccggaaatc 


gtcgtggtat 


tcactccaga 


gcgatgaaaa 


cgtttcagtt 


4740 


tgctcatgga 


aaacggtgta 


acaagggtga 


acactatccc 


atatcaccag 


ctcaccgtct 


4800 


ttcattgcca 


tacg 










4814 



<210> 33 

<211> 4338 

<212> DNA 

<213> Artificial 



<220> 
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<223> 




<220> 
<221> 
<222> 
<223> 



misc_feature 

(3274).. (3274) 

n is a, c, g, or t 



<220> 

<221> misc_feature 
<222> (3294).. (3294) 
<223> n is a, c, g, or t 

ga22tccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 
gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 
ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 
tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 
aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 
ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 
ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 
ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 
gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 
acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact 
ggcttactat gttggcactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 
aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc 
actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc 
ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa 
agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc 
agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc 
tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc 
gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac 
tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 
gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt 
agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg 
tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt 
cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc 
aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca 
atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc 
atgtttgaca gcttatcatc gataagcttt aatgcggtag tttatcacag ttaaattgct 
aacgcagtca ggcaccgtgt atgaaatcta acaatgcgct catcgtcatc ctcggcaccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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tcaccctgga 
atatcggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
caatggcaac 
aacaattaat 
ttccggctgg 
tcattgcagc 
ggagtcaggc 
ttaagcattg 
tcatttttaa 
cccttaacgg 
gggctgcttc 
tgcaaggcga 
ggccagtgcc 
ctttctaatt 
tggatgcaat 
atctccacca 
tatctccatt 
agccctaatc 
ctttccttct 
taaaagctcc 
aacgcctaaa 
tggattttca 
ttttaaagcc 
acaagtatta 
tttgatcgtt 
gcaacagaac 
gtccggtagc 
cttctttatc 
ggggcctgcc 
catcgcagga 



tgctgtaggc 
cttagacgtc 
tctaaataca 
aatattgaaa 
ttgcggcatt 
aacgttgcgc 
agactggatg 
ctggtttatt 
actggggcca 
aactatggat 
gtaacccggg 
tttaaaagga 
catgcaccat 
ctaatgcagg 
ttaagttggg 
aagcttaaaa 
ggctctaaaa 
tccttatttt 
aatttttctg 
attggatttc 
tcttctggag 
ccatccaaac 
cttgctaaca 
taatgaatat 
tcctctcttg 
aatctataga 
cacaaagttt 
cgctcaaaga 
gccatgagcg 
tccttccggt 
atgcaactcg 
accataccca 
tgctgctggc 



54-000330US.ST25.txt 
ataggcttgg ttatgccggt actgccgggc 
aggtggcact tttcggggaa atgtgcgcgg 
ttcaaatatg tatccgctca tgagacaata 
aaggaagagt atgagtattc aacatttccg 
ttgccttcct gtttttgctc acccagaaac 
aaactattaa ctggcgaact acttactcta 
gaggcggata aagttgcagg accacttctg 
gctgataaat ctggagccgg tgagcgtggg 
gatggtaagc cctcccgtat cgtagttatc 
gaacgaaata gacagatcgc tgagataggt 
accaagttta ctcatatata ctttagattg 
tctaggtgaa gatccttttt gataatctca 
tccttgcggc ggcggtgctc aacggcctca 
agtcgcataa gggagagcgt ctggcgaaag 
taacgccagg gttttcccag tcacgacgtt 
aaaatcctta gctttcgcta aggatctgca 
tctttataag ttcttcagct acagcatttt 
taaataaact ctctaactcc tcatagctat 
gcctttttat ggttaaagga tattcaagga 
cttcaacaac tccagctggg cagtatgctt 
agtcatcaac agctataaaa ttcccttttg 
ccgttaagac agggttgtga atacaaacaa 
tgtgtatttt tctctgctcc atccctccaa 
cattaacctg cattattgga tagataactt 
ctataagttc catactcctt cttgctcttt 
cattcagtgt ataatcctta tcaagctgga 
tttatgttga gaatattttt ttgatggggc 
agcggcgcca gggntgtttt tcttttcacc 
gcctcatttc ttattctgag ttacaacagt 
gggcgcgggg catgactatc gtcgccgcac 
taggacaggt gccggcagcg cccaacagtc 
cgccgaaaca agcgccctgc accattatgt 
taccctgtgg aacacctaca tctgtattaa 
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ctcttgcggg 
aacccctatt 
accctgataa 
tgtcgccctt 
actagtgcag 
gcttcccggc 
cgctcggccc 
tctcgcggta 
tacacgacgg 
gcctcactga 
atttaaaact 
tgaccaaaat 
acctactact 
ggggatgtgc 
gtaaaacgac 
gttataatct 
ttaaatccat 
taactgtcaa 
agtatttagc 
tctttatctt 
aagaactcat 
ccttttttgg 
ctgcaacatc 
cagcaacctt 
ttaaggtagt 
attcagcgtt 
gccacttatt 
agtnagacgg 
ccgcaccgct 
ttatgactgt 
ccccggccac 
tccggatctg 
cgaagcgcta 



1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 



3840 
3900 
3960 
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accgttttta tcaggctctg ggaggcagaa taaatgatca tatcgtcaat tattacctcc 3660 

acggggagag cctgagcaaa ctggcctcag gcatttgaga agcacacggt cacactgctt 3720 

ccggtagtca ataaaccggt aaaccagcaa tagacataag cggctattta acgaccctgc 3780 

cctgaaccga cgaccgggtc gaatttgctt tcgaatttct gccattcatc cgcttattat 

cacttattca ggcgtagcac caggcgttta agggcaccaa taactgcctt aaaaaaatta 

cgccccgccc tgccactcat cgcagtactg ttgtaattca ttaagcattc tgccgacatg 

gaagccatca cagacggcat gatgaacctg aatcgccagc ggcatcagca ccttgtcgcc 4020 

ttgcgtataa tatttgccca tggtgaaaac gggggcgaag aagttgtcca tattggccac 4080 

gtttaaatca aaactggtga aactcaccca gggattggct gagacgaaaa acatattctc 4140 

aataaaccct ttagggaaat aggccaggtt ttcaccgtaa cacgccacat cttgcgaata 4200 

tatgtgtaga aactgccgga aatcgtcgtg gtattcactc cagagcgatg aaaacgtttc 4260 

agtttgctca tggaaaacgg tgtaacaagg gtgaacacta tcccatatca ccagctcacc 4320 

gtctttcatt gccatacg 



4338 



<210> 34 

<211> 2271 

<212> DNA 

<213> Artificial 

<220> 

<223> plasmid pKQ 

atggatccga gctcgagatc tgcagctggt accatatggg aattcgaagc ttgggcccga 60 

acaaaaactc atctcagaag aggatctgaa tagcgccgtc gaccatcatc atcatcatca 120 

ttgagtttaa acggtctcca gcttggctgt tttggcggat gagagaagat tttcagcctg 180 

atacagatta aatcagaacg cagaagcggt ctgataaaac agaatttgcc tggcggcagt 240 

agcgcggtgg tcccacctga ccccatgccg aactcagaag tgaaacgccg tagcgccgat 300 

ggtagtgtgg ggtctcccca tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa 360 

ggctcagtcg aaagactggg cctttcgttt tatctgttgt ttgtcggtga acgatatctg 420 

cttttcttcg cgaattaatt ccgcttcgca acatgtgagc aaaaggccag caaaaggcca 

ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 

atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 

aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 

gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta 720 

ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 

ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 

acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 

gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat 960 
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ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 1020 

ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 1080 

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 1140 

ggaacgaaaa ctcacgttaa gggattttgg tcatgagttg tgtctcaaaa tctctgatgt 1200 

tacattgcac aagataaaaa tatatcatca tgaacaataa aactgtctgc ttacataaac 1260 

agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcttgctc gaggccgcga 1320 

ttaaattcca acatggatgc tgatttatat gggtataaat gggctcgcga taatgtcggg 1380 

caatcaggtg cgacaatcta tcgattgtat gggaagcccg atgcgccaga gttgtttctg 1440 

aaacatggca aaggtagcgt tgccaatgat gttacagatg agatggtcag actaaactgg 1500 

ctgacggaat ttatgcctct tccgaccatc aagcatttta tccgtactcc tgatgatgca 1560 

tggttactca ccactgcgat ccccgggaaa acagcattcc aggtattaga agaatatcct 1620 

gattcaggtg aaaatattgt tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt 1680 

cctgtttgta attgtccttt taacagcgat cgcgtatttc gtctcgctca ggcgcaatca 1740 
cgaatgaata acggtttggt tgatgcgagt gattttgatg acgagcgtaa tggctggcct 
gttgaacaag tctggaaaga aatgcataag cttttgccat tctcaccgga ttcagtcgtc 

actcatggtg atttctcact tgataacctt atttttgacg aggggaaatt aataggttgt 1920 

attgatgttg gacgagtcgg aatcgcagac cgataccagg atcttgccat cctatggaac 1980 

tgcctcggtg agttttctcc ttcattacag aaacggcttt ttcaaaaata tggtattgat 2040 

aatcctgata tgaataaatt gcagtttcat ttgatgctcg atgagttttt ctaatcagaa 2100 

ttggttaatt ggttgtaaca ctggcagagc attacgctga cttgacggga cggcggcttt 2160 

gttgaataaa tcgaactttt gctgagttga aggatcctcg ggagttgtca gcctgtcccg 2220 

cttataagat catacgccgt tatacgttgt ttacgctttg aggaattaac c 2271 

<210> 35 
<211> 4582 
<212> DNA 
<213> Artificial 

<220> 

<223> plasmid pKQ-PhKep 

<400> 35 cn 

atggttcatt gggccgatta tattgctgat aaaataatta gagagagggg ggagaaggag 60 

aagtacgttg ttgagagtgg aataacgcca agtggttacg ttcacgttgg gaactttagg 120 

gagcttttta cagcttatat tgtgggccat gccctaaggg ataaggggta tgaggttagg 180 

cacatccaca tgtgggatga ttatgataga tttaggaagg ttccaaggaa cgttccccag 240 

gaatggaaag attacctggg aatgcccatt agtgaagttc ctgatccctg gggatgccat 300 

gagagttatg ctgaacactt catgagaaag ttcgaggagg aggtagaaaa attagggatc 360 
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gaagttgact 


ttctttatgc 


gagtgaactc 


tacaagagag 


gggaatattc 


tgaggagata 


420 


aggttagcct 


ttgagaaaag 


ggataagata 


atggagatac 


taaacaagta 


tagggaaatt 


480 


gcgaaacaac 


ctccccttcc 


agagaactgg 


tggcccgcaa 


tggtttactg 


ccctgagcat 


540 


aggagggaag 


cagagatcat 


tgaatgggat 


gggggctgga 


aggttaagta 


taagtgcccc 


600 


gaaggtcacg 


agggatgggt 


tgatataagg 


agtgggaacg 


tgaaactgag 


gtggcgtgtt 


660 


gattggccca 


tgcgttggtc 


tcactttggc 


gttgacttcg 


aacctgctgg 


aaaggatcat 


720 


cttgtggctg 


gttcaagcta 


cgatacggga 


aaggagatta 


taaaggaagt 


ttatggaaag 


780 


gaagctccgt 


tatctttaat 


gtatgagttt 


gttggaatta 


aggggcagaa 


ggggaagatg 


840 


agtggtagta 


agggaaatgt 


tattttactc 


agcgatctgt 


atgaggttct 


tgagccaggt 


900 


ctcgttagat 


ttatctacgc 


tcggcatagg 


ccaaacaagg 


agataaagat 


agatctaggt 


960 


cttggcattc 


taaacctcta 


cgatgagttc 


gataaagttg 


agagaatata 


cttcggggtt 


1020 


gagggtggta 


aaggtgatga 


tgaagaatta 


aggaggactt 


acgagctttc 


ggtgatgctg 


1080 


ccaacttact 


gatttagtgt 


atgatggtgt 


ttttgaggtg 


ctccagtggc 


ttctgtttct 


1140 


atcagctgtc 


cctcctgttc 


agctactgac 


ggggtggtgc 


gtaacggcaa 


aagcaccgcc 


1200 


ggacatcagc 


gctatctctg 


ctctcactgc 


cgtaaaacat 


ggcaactgca 


gttcacttac 


1260 


accgcttctc 


aacccggtac 


gcaccagaaa 


atcattgata 


tggccatgaa 


tggcgttgga 


1320 


tgccgggcaa 


ccgcccgcat 


tatgggcgtt 


ggcctcaaca 


cgattttccg 


ccatttaaaa 


1380 


aactcaggcc 


gcagtcggta 


acctcgcgca 


tacagccggg 


cagtgacgtc 


atcgtctgcg 


1440 


cggaaatgga 


cgaacagtgg 


ggatacgtcg 


gtgctaaatc 


gcgccagcgc 


tggctgtttt 


1500 


acgcgtatga 


caggctccgg 


aagacggttg 


ttgcgcacgt 


attcggtgaa 


cgcactatgg 


1560 


cgacgctggg 


gcgtcttatg 


agcctgctgt 


caccctttga 


cgtggtgata 


tggatgacgg 


1620 


atggctggcc 


gctgtatgaa 


tcccgcctga 


agggaaagct 


gcacgtaatc 


agcaagcgat 


1680 


atacgcagcg 


aattgagcgg 


cataacctga 


atctgaggca 


gcacctggca 


cggctgggac 


1740 


ggaagtcgct 


gtcgttctca 


aaatcggtgg 


agctgcatga 


caaagtcatc 


gggcattatc 


1800 


tgaacataaa 


acactatcaa 


taagttggag 


tcattacccc 


gagctttcaa 


tgcctaagaa 


1860 


gcctgagaga 


ttagtcgctc 


aagctccttt 


taggttccta 


gcggtgttgg 


ttcagttacc 


1920 


gcatttaacc 


gaagaagaca 


taataaatgt 


tctaatcaaa 


cagggacata 


ttcccaggga 


1980 


tctatccaag 


gaggacgttg 


agagggttaa 


acttaggata 


aaccttgcta 


ggaattgggt 


2040 


taaaaagtat 


gcccctgagg 


atgttaaatt 


ctcaatactt 


gagaaacctc 


cagaagttga 


2100 


ggtaagtgaa 


gatgttaggg 


aggccatgaa 


tgaggttgct 


gagtggcttg 


agaatcatga 


2160 


ggaatttagc 


gttgaagagt 


ttaataacat 


tctattcgaa 


gttgccaaga 


ggagggggat 


2220 


atccagtagg 


gagtggtttt 


cgacgctcta 


cagattattt 


attggaaagg 


aaaggggacc 


2280 


gagattggcc 


agtttcctgg 


catctcttga 


taggagtttc 


gttattaaac 


gacttagact 


2340 


tgagggataa 


gaattcgaag 


cttgggcccg 


aacaaaaact 


catctcagaa 


gaggatctga 


2400 
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atagcgccgt 


caaccatcat 


catcatcatc 


attgagttta 


aacggtctcc 


aqcttqqctq 


2460 


ttttqqcacia 


tgagagaaga 


ttttcagcct 


gatacagatt 


aaatcagaac 


gcagaagcgg 


2520 


tctgataaaa 


caoaatttac 

V* VA VJ VA V*. V. V V> VJ V- 


ctaacaacaa 

v- "-yy^yy v.uy 


taacacaata 

*-**y v.y v.yy *-y 


gtcccacctg 


accccatgcc 


2580 


gaactcagaa 


ataaaacacc 


ataqcqccqa 


tggtagtg*tg 


gggtctcccc 


atgcgagagt 


2640 


agggaactgc 


caggcatcaa 


ataaaacgaa 


aggctcagtc 


gaaagactgg 


gcctttcgtt 


2700 


ttatctgttg 


tttatcaata 


aacgatatct 


gcttttcttc 


gcgaattaat 


tccgcttcgc 


2760 


aacatataaa 

Vl Wi V* VA V_ VJ V, VJ WE Vf 


caaaaaacca 

u uuu y V— v. *jl 


acaaaaaacc 

Vf V_ VA VA VA V* Vf Vf V» V 


aaaaaccatia 

y y u u v. \— y u 


aaaaaaccac 

VA VA VA VA Vf Vf V* V- Vf V* 


y »- Lyv. <-yy v-y 


2820 


tttttccata 

V- V 1— V V V- V* VA V VA 


QQCtCCQCCC 

y y v_ i_ V- v. y v» v» v* 


ccctaacaaa 

V— V- >— V Vf VA V- VJ V4 VJ 


catcacaaaa 

V- VA V V- Vi. V^ VA VA VA VA 


atcgacgctc 


aagtcagagg 


2880 


taacaaaacc 

V- VJ \-f v_ VJ VA V*. VA V* V* 


caacaaaact 


ataaaaatac 

VA V VA VA VA Vf VA V. VA V- 


caaacatttic 

V* VA Vf Vf V* Vf V V V V* 


cccctaaaaa 

V* V— V* V* V A^ Vf VA VA Aj 


ctccctcata 

V- ^ >w v_ V V- « V VJ 


2940 


cgctctcctg 


ttccaaccct 

V> V V V* Vf V*. V* V* V* Vr 


gccgcttacc 


ggatacctgt 


ccgcctttct 


cccttcggga 


3000 


aacqtqqcqc 


tttctcatag 


ctcacgctgt 


aggtatctca 


gttcggtgta 


ggtcgttcgc 


3060 


tccaagctgg 


actatataca 


cgaacccccc 


gttcagcccg 


accgctgcgc 


cttatccggt 


3120 


aactatcgtc 


ttgagtccaa 


cccggtaaga 


cacgacttat 


cgccactggc 


agcagccact 


3180 


aat aacaaaa 


ttaacaaaac 


aaaatatata 

Vf VA Vf Vf V> VA V VJ V VA 


aacaatactia 

y y v- y y y s_ *_ u 


caaaattctt 

V- VA Vf VA Vf V V V* V V 


aaaataataa 

Vf VA VA Vf V Vf VJ V> Vf VJ 


3240 


cctaactacg 


gctacactag 


aaggacagta 


tttggtatct 


gcgctctgct 


gaagccagtt 


3300 


accttcggaa 


aaagagttgg 


tagctcttga 


tccggcaaac 


aaaccaccgc 


tggtagcggt 


3360 


ggtttttttg 


tttgcaagca 


gcagattacg 


cgcagaaaaa 


aaggatctca 


agaagatcct 


3420 


ttgatctttt 


ctacggggtc 


tgacgctcag 


tggaacgaaa 


actcacgtta 


agggattttg 


3480 


atcataaatt 

Vf V V* VA v> VJ VA Vf V- U 


atotctcaaa 

Vf v> VJ V V* V V- VA VA VA 


atctctaata 

VA V V V V^ V Vf VA V Vf 


"ttacattaca 

V> V> VA V VA V v> Vf V VA 


caaaataaaa 

Vji VA VA Vf VA VVAVAVAVA 


at at at cat c 


3540 


ataaacaata 

V*. 1. VJ U. VA V- U VA L V*. 


aaactatcta 

UU.U V V VJ v- V. V Vf 


cttacataaa 

V- V V> VA V> VA V> VA VA VA 


caataataca 

V- VA VJ V VA VA V VA V- VA 


aaaaatatta 

AA Vf Vf Vf VJ V Vf V> LU 


taaaccatat 

V Vf VA Vf V— ■ V* VA V VA V 


3600 


tcaacaaoaa 


acotcttoct 

ul v. y i- V- v. v. y v. \- 


v» y la. y y v. v_ y v» y 


attaaattcc 


aacataaatia 

VA VA V_ VA V VJ VJ CA V Vf 


eta at t tat a 

V* V VJ VA V V. V. VA V_ VA 


3660 


taaatataaa 

V_ VJ VJ VJ V VA Vi *-A VI Vi 


V Vf VJ VJ \w V V* VJ V- VJ 


at aat a tea a 

VA V VA VA V Vf V V— VJ Vf 


acaatcaaat: 

Vf V_ VA V4 V V* AA Vf Vf V 


acaacaatct 

Vf V* Vf VA V— VA VA V* V- V. 


atcgattgta 


3720 


taaaaaaccc 

V> VJ Vf VJ VA VA VJ V— V V* 


aatacaccaa 

Vf VA LMVMWUM 


aattatttct 

VA VJ V V. Vf V> V V) V* Vi 


aaaacataac 

Vf VA VA VA V*p VA V Vf Vf V* 


aaaaataaca 

VA VA VA Vf Vf V VA Vf V» Vf 


ttaccaataa 

V V> VJ V- V- VA VA V XJ VA 


3780 


tattacaaat 

V> V V V* V-» VA VJ va v 


aaaataatca 

Vf VA Vf VA v| VV»VA 


aactaaacta 

VJ VA Vjb VVAVAVAVj- V VJ 


actaacaaaa 

^3 V^ V V* V- V<| %J VA VA 


tttatgcctc 


ttccgaccat 


3840 


caagcatttt 


atccgtactc 


ctgatgatgc 


atggttactc 


accactgcga 


tccccgggaa 


3900 


aacaacattc 

IAU VUM VU V V- V- 


caaatattaa 

V- VA Vf VJ V VA V V VA VJ 


aaaaatatcc 

VA VA Vf VA VA V VA V* V* V* 


"taattcaaat: 

V VJ VA V. V. V* VA Vf Vf V 


gaaaatattg 


ttaatacact 

V- V> VJ VA V VJ V_ VJ V- V 


3960 


aacaatattc 
yy y *-y *- ^- v - 


ctacaccaat 

iw y V— y v_ y y v. 


tacattcaat 

w y v.. v* v. *» v» y u. v. 


tcctatttat 


aattatcctt 

VA VA V> V Vf V V* V* V. V. 


ttaacaacaa 

V V VA VA V* VA VJ V Vf VA 


4020 


tcacatattt 

V V_ VJ V» VJ V_ VI V V> Vi 


catctcactc 

V» VJ V, V» V* V» Vf V* V V* 


aaacacaatc 

VA Vf Vf V* VJ V* VA VA V V* 


acaaataaat 

VA V-» Vf VA VA V> VJ VA VA V 


aacaatttaa 

WIA VMM V V V VJ VJ 


ttaatacaaa 

*- vy** v>yv.yuy 


4080 


tgattttgat 


gacgagcgta 


atggctggcc 


tgttgaacaa 


gtctggaaag 


aaatgcataa 


4140 


gcttttgcca 


ttctcaccgg 


attcagtcgt 


cactcatggt 


gatttctcac 


ttgataacct 


4200 


tatttttgac 


gaggggaaat 


taataggttg 


tattgatgtt 


ggacgagtcg 


gaategcaga 


4260 


ccgataccag 


gatcttgcca 


tcctatggaa 


ctgcctcggt 


gagttttctc 


cttcattaca 


4320 


gaaacggctt 


tttcaaaaat 


atggtattga 


taatcctgat 


atgaataaat 


tgcagtttca 


4380 
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tttgatgctc gatgagtttt tctaatcaga attggttaat tggttgtaac actggcagag 4440 

cattacgctg acttgacggg acggcggctt tgttgaataa atcgaacttt tgctgagttg 4500 

aaggatcctc gggagttgtc agcctgtccc gcttataaga tcatacgccg ttatacgttg 4560 

tttacgcttt gaggaattaa cc 4582 

<210> 36 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oligonucleotide primer 

<400> 36 o n 
cagtggaatt cagtaagttg gcagcatcac DKJ 

<210> 37 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> oligonucleotide primer 

<400> 37 31 
cattggaatt cgagtaagtt ggcagcatca c D± 
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